CDS
Accession Number | TCMCG075C08584 |
gbkey | CDS |
Protein Id | XP_017973885.1 |
Location | join(1745179..1745227,1745344..1745393,1745618..1745650,1746328..1747401,1747704..1748258,1748332..1748442,1748537..1748626,1748714..1748785,1748932..1749042,1749645..1749771,1750814..1751016,1751400..1751552,1752450..1752650) |
Gene | LOC18604054 |
GeneID | 18604054 |
Organism | Theobroma cacao |
Protein
Length | 942aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018118396.1 |
Definition | PREDICTED: DNA mismatch repair protein MSH2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGATGAAAATTTTGATGAACGAAACAAGCTTCCAGAGCTCAAACTAGATGCTAAGCAGGCTCAAGGGTTTCTCTCTTTCTTCAAAACCCTACCCAATGATGCAAGGGCAGTTCGGTTTTTTGATCGCCGGGATTATTATACTGCTCATGGTGAAAATGCAACCTTTATTGCAAAGACATATTACCGCACTACTACTGCTCTCCGGCAACTGGGTAGCGGCTCTGATGGCCTTTCAAGTGTAACTGTTAGTAAAAACATGTTTGAAACAATTGCTCGTGATCTTCTCCTGGAGAGAACAGACCACACTCTGGAGCTCTATGAAGGCAGTGGCTCCCATTGGAGGTTAATGAAAAGTGGCAGTCCTGGGAATCTGGGCAGTTTTGAAGATGTTCTGTTTGCCAACAATGAGATGCAGGACACACCTGTTGTTGTTGCATTGCTTCCTAACTTCCGTGAAAATGGGTGCACTATTGGGTTCAGTTATGTTGATTTAACGAAGAGGGTACTTGGATTGGCTGAATTTCTTGATGATAGTCACTTTACAAATACAGAGTCGGCTTTGGTTGCTCTCGGTTGCAAGGAATGCCTTTTGCCCATAGAGAGTGGAAAAGCCAGTGAATGTAGAACTCTCAATGATGCTTTGACCAGATGTGGTGTTATGGTAACTGAGAGAAAGAAAACTGAGTTTAAAGCAAGGGATCTGGTTCAGGATCTTGGCAGACTAATCAAAGGTTCCATTGAACCAGTTCGAGACTTGGTTTCTGGATTTGAATTTGCACCTGCTGCTTTAGGAGCCTTACTATCTTATGCAGAACTACTGGCAGATGAAGGCAATTATGGAAATTATAGCATCCGGAGATACAATCTTGGCAGCTACATGAGATTAGATTCTGCTGCTATGAGGGCATTGAATGTCCTAGAAAGCAGAACTGATGCAAACAAAAATTTTAGTTTGTTTGGTCTTATGAATAGAACCTGTACCGCTGGGATGGGTAAGCGGTTGCTTCATATGTGGCTAAAACAGCCTTTGTTAGATGTAAGTGAGATAAACTCAAGGCTAGATTTGGTACAAGCTTTTGTGGAGGATACCGAGCTTCGCCAAGCTTTGAGGCAGCATCTGAAAAGAATTTCAGATATTGAGCGACTTATGCGCAATATTGAAAAGACAAGAGCTGGTTTGCAGCATGTTGTAAAACTTTATCAGTCAAGTATAAGAATTCCCTACATTAAAAGTGCCCTGGAAAAATATGATGGACAGTTTTCATCCTTGATCAAGGAAAGATATTTGGATCCTTTTGAGCTCTTCACTGACGACGATCATTTGAACAAGTTCATTTCTCTTGTTGAAACTTCTGTCGACCTAGATCAACTTGAAAATGGGGAATACATGATTTCACCTAGTTATGATGATGCCCTAGCTGCACTAAAAAATGAGCAGGAGTCACTAGAGCTCCAAATACACAACTTACATAAACAAACTGCTATTGATCTTGATCTGCCAGTAGACAAGGCATTAAAGTTAGATAAGGGCACACAGTTTGGACATGTTTTCAGAATTACAAAGAAAGAAGAGCCAAAAGTAAGAAAAAAGCTCTCCACCCAATTTATTATTCTTGAAACTCGAAAGGATGGAGTAAAATTCACTAGCACAAAGCTTAAAAAGTTGGGGGACCAGTACCAAAAGATACTTGAGGAGTATAAGAACTGTCAAAAAGAACTAGTCAACCGAGTGGTTCAAACTACAGCAACTTTCTCTGAGGTGTTTGAGCCCTTAGCTGGGTTGCTCTCCGAATTGGATGTCTTGCTTAGTTTTGCTGATTTAGCTTCTAGTTGCCCTACCCCATACACAAGACCTGAAATTACTCCAGCGGATGTAGGAGATATTGTATTAGAAGGAAGTAGACATCCCTGTGTGGAGGCGCAAGACTGGGTGAATTTTATACCAAATGATTGTAGACTTGTAAGAGGAAAGAGCTGGTTCCAGATCATCACTGGGCCTAATATGGGTGGAAAATCAACATTCATCCGGCAGGTTGGTGTCAACATTCTGATGGCACAAGTAGGTTCTTTTGTTCCTTGTGAAAAAGCTAGCATTTCTGTCCGAGACTGCATTTTTGCCCGTGTTGGTGCTGGTGACTGCCAACTACGTGGAGTTTCTACCTTTATGCAAGAAATGCTTGAAACTGCATCAATATTGAAAGGAGCTACTGACAAGTCATTGATAATCATTGATGAGTTGGGGCGAGGAACATCAACCTATGATGGATTTGGTTTAGCATGGGCCATATGCGAGCATATTGTTGAAGTGATCAAAGCACCTACTTTGTTCGCTACCCACTTCCATGAACTGACTGCATTAGCTCATGAAAATGTCAATGATGAGCCACAGGCAAAACAGATTGTTGGTGTGGCAAACTATCATGTTAGTGCTCACATTGACTCATCAAGTCGCAAATTGACAATGCTGTACAAGGTTGAGCCAGGTGCCTGTGATCAAAGTTTTGGTATCCATGTAGCAGAATTTGCCAACTTTCCTGAAAGTGTTATATCCCTTGCAAGAGAAAAGGCTGCTGAATTGGAAGATTTCTCGCCAACTTCAATCATTTCCAGTGATGCTAGACAAGAGGAAGGTTCTAAAAGGAAGCGAGAGTGTGATCCTATTGACATGTCTAGAGGTGCTGCAAAGGCTCACAAGTTCTTGAAGGACTTTGCTGATTTGCCATTAGAGTCTATGGACCTGAAGCAGGCTCTGCAACAAGTAAACAAGCTAAGGGGTGACTTAGAAAAGGATGCAGTAAACTGTAACTGGCTCCGGCAATTCCTTTAG |
Protein: MDENFDERNKLPELKLDAKQAQGFLSFFKTLPNDARAVRFFDRRDYYTAHGENATFIAKTYYRTTTALRQLGSGSDGLSSVTVSKNMFETIARDLLLERTDHTLELYEGSGSHWRLMKSGSPGNLGSFEDVLFANNEMQDTPVVVALLPNFRENGCTIGFSYVDLTKRVLGLAEFLDDSHFTNTESALVALGCKECLLPIESGKASECRTLNDALTRCGVMVTERKKTEFKARDLVQDLGRLIKGSIEPVRDLVSGFEFAPAALGALLSYAELLADEGNYGNYSIRRYNLGSYMRLDSAAMRALNVLESRTDANKNFSLFGLMNRTCTAGMGKRLLHMWLKQPLLDVSEINSRLDLVQAFVEDTELRQALRQHLKRISDIERLMRNIEKTRAGLQHVVKLYQSSIRIPYIKSALEKYDGQFSSLIKERYLDPFELFTDDDHLNKFISLVETSVDLDQLENGEYMISPSYDDALAALKNEQESLELQIHNLHKQTAIDLDLPVDKALKLDKGTQFGHVFRITKKEEPKVRKKLSTQFIILETRKDGVKFTSTKLKKLGDQYQKILEEYKNCQKELVNRVVQTTATFSEVFEPLAGLLSELDVLLSFADLASSCPTPYTRPEITPADVGDIVLEGSRHPCVEAQDWVNFIPNDCRLVRGKSWFQIITGPNMGGKSTFIRQVGVNILMAQVGSFVPCEKASISVRDCIFARVGAGDCQLRGVSTFMQEMLETASILKGATDKSLIIIDELGRGTSTYDGFGLAWAICEHIVEVIKAPTLFATHFHELTALAHENVNDEPQAKQIVGVANYHVSAHIDSSSRKLTMLYKVEPGACDQSFGIHVAEFANFPESVISLAREKAAELEDFSPTSIISSDARQEEGSKRKRECDPIDMSRGAAKAHKFLKDFADLPLESMDLKQALQQVNKLRGDLEKDAVNCNWLRQFL |